Efficient data mappings for parity-declustered data layouts
نویسندگان
چکیده
The joint demands of high performance and fault tolerance in a large array of disks can be satisfied by a parity-declustered data layout – an arrangement of data and redundant information that allows the rapid reconstruction of lost data while the array continues to operate. A data layout is typically generated by partitioning the data units on the disks into stripes and choosing one or more units per stripe to hold redundant information. Such a data layout can be represented as a table of stripes. The data mapping problem is the problem of translating a data address in a linear address space (the file system’s view) into a disk identifier and an offset on the disk where the data is stored. Typically, the disk and offset are obtained from the data layout using table lookups, but recent work has yielded mappings that compute (disk, offset) pairs directly from data addresses without the need to store tables. In this paper, we show that parity-declustered data layouts based on commutative rings yield mappings with improved computational efficiency. These layouts also apply to a wider range of array configurations than other known layouts that do not use table lookup.
منابع مشابه
Permutation Development Data Layout (PDDL)
Declustered data organizations in disk arrays (RAIDs) achieve less-intrusive reconstruction of data after a disk failure. We present PDDL, a new data layout for declustered disk arrays. PDDL layouts exist for a large variety of disk array configurations with a distributed spare disk. PDDL declustered disk arrays have excellent run-time performance under light and heavy workloads. PDDL maximizes...
متن کاملPermutation Development Data Layout (PDDL) Disk Array Declustering
Declustered data organizations have been proposed to achieve less-intrusive reconstruction of a failed disk’s contents. In previous work, Holland and Gibson identified six desirable properties for ideal layouts. Ideal layouts exist for a very limited family of configurations. The PRIME data layout deviates from the stated ideal only slightly and its run-time performance is very good for light w...
متن کاملAn Example of Data Dependence Result for The Class of Almost Contraction Mappings
In the present paper, we show that $S^*$ iteration method can be used to approximate fixed point of almost contraction mappings. Furthermore, we prove that this iteration method is equivalent to CR iteration method and it produces a slow convergence rate compared to the CR iteration method for the class of almost contraction mappings. We also present table and graphic to support this result. F...
متن کاملCompound Treatment of Chained Declustered Replicas Using a Parallel Btree for High Scalability and Availability
Scalability and availability are key features of parallel database systems. To realize scalability, many dynamic load-balancing methods with data placement and parallel index structures on sharednothing parallel infrastructure have been proposed. Data migration with range-partitioned placement using a parallel Btree is one solution. The combination of range partitioning and chained declustered ...
متن کاملSpectral Characterization of Iterating Lossy Mappings
In this paper we study what happens to sets when we iteratively apply lossy (round) mappings to them. We describe the information loss as imbalances of parities of intermediate distributions and show that their evolution is governed by the correlation matrices of the mappings. At the macroscopic level we show that iterating lossy mappings results in an increase of a quantity we call total imbal...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Theor. Comput. Sci.
دوره 325 شماره
صفحات -
تاریخ انتشار 2004